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(54) Method and apparatus for performing pre-emptive maintenance on operating equipment 



(57) A pre-emptive maintenance system for per- 
forming maintenance and process assurance on run- 
critical equipment employs an intrinsic health monitor 
that includes a number of sensors that are used with an 
operating equipment or machine to generate a set of 
intrinsic physical signatures that are products of the pri- 
mary performance characteristics of the operating 
equipment. These signatures are nonfunctional para- 
metrics that are consistent and reliable indicators of the 
normal operation of the equipment and statistical norms 



are set during an initial learning mode using neural 
processors and the outputs from the sensors. Thereaf- 
ter, the sensor outputs are monitored and analyzed 
using the statistical norms to predict the probabilities of 
future failures of the operating equipment. By using the 
intrinsic health monitor an advance in the field of equip- 
ment maintenance and process assurance is provided 
in the approach to maintenance being a pre-emptive 
procedure. 
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Description 

BACKGROUNH OF THE INVENTION 
Field of the Invention 

[0001] This invention relates generally to a method 
and apparatus for detecting impending failure of operat- 
ing equipment and, more particularly, to a method and 
apparatus for sensing equipment parameters and per- 
forming statistical and neural processing of the detected 
parameters for the purpose of prognosticating the future 
operational status of the equipment. 

Description of the Related Art 

[0002] The earliest attempts at maintenance were 
simply run to failure, that is. machinery or operating 
equipment would simply be run until a failure occurred 
and then the equipment was repaired and the relevant 
operation continued. Because the unexpected failure of 
machinery and the interruption of industrial processes 
can cause great economic loss as well as environmen- 
tal harm, other maintenance approaches have been 
found to be necessary For example, pumps, motors, 
transformers, pipes, and other machine devices 
involved in the processing, transport, and distribution of 
chemicals, petroleum, and waste often cause consider- 
able environmental damage and economic loss, as well 
as hazards to human safety, when an unexpected 
machine/device or process failure occurs. 
[0003] Thus, it is seen that all of these adverse effects 
have required the run to failure maintenance approach 
to evolve into one that can detect impending operating 
device failure prior to the actual occurrence of such a 
failure. The concepts of preventive maintenance as well 
as predictive maintenance are presently being used. 
Preventive maintenance is an approach that requires 
the engineer or process manager to make some 
assumptions, which are typically conservative, as to 
when failure is likely to occur. As a consequence of 
these assumptions, the equipment is maintained by 
means of rigid schedules based on elapsed time of 
operation. Typical of this approach are: 1 ) lubricate main 
bearings every 50 hours. 2) replace oir seals every 100 
hours, and so forth. Following this approach, the 
machines and operating equipment are maintained, 
repaired, or replaced prior to the time that such actions 
are actually necessary. Therefore, the preventive main- 
tenance approach, which is dependent on excessive 
record keeping, can be costly yet still not be a fool-proof 
way for preventing unanticipated failures. For example, 
there may still be latent failures based on errors of man- 
ufacture, errors in factoring the maintenance schedule 
or induced failures caused by misalignments and the 
like at the time the actual equipment was installed. 
[0004] On the other hand, predictive maintenance is a 
more current technique that relies on expert knowledge 



of specific machinery, which knowledge is derived 
through testing and experimental data. Predictive main- 
tenance is implemented as periodic inspections which 
may include oil samples, infra-red thermography or 

5 vibration monitoring in order to detect known undesira- 
ble patterns which may imply a maintenance action, for 
example, oil contains too much hydrocarbons; time to 
change it. The relevant inspections are performed using 
portable equipment and generally an analysis is con- 

10 ducted "on site" or such analysis may be performed at a 
central processing facility using data collected by the 
portable data equipment. For example, vibration moni- 
toring is performed with hand-held portable data collec- 
tors which include a probe placed against a machine 

, 5 and held in contact with the machine for a few seconds 
in order to measure vibration spectral characteristics. 
Certain spectral signatures would imply that a specific 
bearing is wearing out and should be replaced. The 
monitoring may be performed on a daily, monthly, or 

20 even a quarter-annually basis. Among the many draw- 
backs of such a monitoring scheme, is that it is labor 
intensive. Furthermore, the expert diagnostic analysis 
that can be performed on the data is limited because 
only a brief "snap shot" of the data is obtained for anal- 

2 5 ysis. In addition, such a method does not take into 
account equipment loading nor environmental variables 
such as ambient temperature. 

[0005] Examples of such vibration detection and anal- 
ysis systems are shown in U.S. Patents 4,520.674 and 
30 4,985,857. These systems all suffer the drawbacks 
noted above relative to the predictive maintenance sys- 
tems, including excessive labor and too early replace- 
ment or maintenance action. 

35 QFUECTS AND 'NUMMARY O F THE INVENTION 

[0006] It is an object of the present invention to pro- 
vide a method and apparatus for performing pre-emp- 
tive maintenance on operating equipment that can 
40 eliminate the above-noted defects inherent in the previ- 
ously proposed systems. 

[0007] Another object of the present invention is to 
provide an improved maintenance system in which 
maintenance is performed based upon an analysis of 
45 detected physical parameters of the operating equip- 
ment. 

[0008] A further object of the present invention is to 
initially perform a learning operation using detected 
information from the operating equipment so that knowl- 

so edge can be gained for use in determining the future 
indications of failure or out of tolerance performance. 
[0009] The invention provides a system according to 
Claim 1. Preferred embodiments are described in 
Claims 2-10. The invention also provides a method 

55 according to Claim 11. Preferred embodiments are 
described in Claims 12-18. 

[0010] In accordance with an aspect of the present 
invention, the operating equipment is operated to gener- 
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ate a set of intrinsic physical signatures that are prod- 
ucts of the primary performance characteristics of the 
operating equipment. These^signatures are nonfunc- 
tional parametrics that are consistent and reliable indi- 
cators of normal operation of the equipment. By 
carefully detecting and analyzing these physical param- 
eters one can confirm an equipment and/or process 
operational status relative to normal or abnormal opera- 
tion. Furthermore, by processing the parametric data, it 
is possible to provide indications of future failure or 
impending out of tolerance performance of the opera- 
tional equipment. 

[001 1 ] In order for the present invention to determine 
that the operating equipment is performing within its so- 
called norms and to then detect significant deviations 
from normal, an initial learning mode is selected and a 
learning operation is performed. The present invention 
detects data from the operating equipment at periodic 
intervals within a one hour time period and forms statis- 
tical norms at the end of each hour for each sensor and 
for several frequency bands of vibration in regard to the 
vibration sensor. During the hourly calculation opera- 
tion, dependent norms in the form of average value cor- 
relations are obtained for each vibration frequency band 
with respect to temperature and load. Independent 
norms are also obtained with respect to the ambient to 
unit temperature difference and the electrical current to 
load ratio. The intervals of the data acquisition during 
she monitor mode are identical to the periodic data 
intervals in the learn mode described above. Hourly cal- 
culations are also performed in monitor mode in a simi- 
lar manner to those of learamode. An additional failure 
probability calculation is performed in monitor mode 
which compares the learn mode norms with the monitor 
mode data obtained each hour, and the potential for fail- 
ure is estimated. 

[0012] The above and other objects, features, and 
advantages of the present invention will become appar- 
ent from the following detailed description of illustrative 
embodiments to be read in conjunction with the accom- 
panying drawings in which like reference numerals rep- 
resent the same or similar elements. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0013] 

Fig. 1 is a schematic in block diagram form of an 
overall system according to an embodiment of the 
present invention; 

Fig. 2 is a block diagram'showing the system of Fig. 
1 in more detail; 

Fig. 3 is a schematic in block diagram form of the 
monitoring unit used in the system of Fig. 1 ; 
Fig. 4 is a flow-chart showing the operations of 
executive routine performed by the processor used 
in the unit of Fig. 3; 

Fig. 5 is a flow-chart showing the operations of the 



raw data acquisition operations of the processor in 
the unit of Fig. 3; 

Fig. 6 is a flow-chart showing the operations of the 
hourly calculations of the processor in the unit of 
5 Fig. 3; 

Fig. 7 is a flow-chart showing the operations of the 
learning mode calculations of the processor in the 
unit of Fig. 3; 

Fig. 8 is a flow-chart showing the operations of the 
w monitoring mode calculations of the processor in 
the unit of Fig. 3; 

Fig. 9 is a diagram of the operation of the neural 
processor; 

Fig. 10 is a sample of a subsystem overview dis- 
15 play; 

Fig. 1 1 is a sample of a detailed status display; and 
Fig. 12 is a sample of a display for each status bar 
in the display of Fig. 10. 

20 DETAILED DESCRIPTION OF PREFERRED EMBOD- 
IMENTS 

[0014] The present invention provides a system utiliz- 
ing specific electronic equipment following predeter- 

25 mined operating programs that determines the future 
general condition of a machine. More specifically, pre- 
emptive maintenance is a system for performing main- 
tenance and process assurance on run-critical equip- 
ment, which is equipment that must operate in order so 

30 assure a process and that if otherwise interrupted would 
cause significant environmental or financial damage. 
The system of pre-emptive maintenance utilizes intrin- 
sic health monitoring to discover impending 
machine/process failure; uses expert diagnostic testing 

35 to determine the specific failure that is going to occur; 
and uses data from a reference library to apply the 
appropriate maintenance action in order to keep the 
process operating without interruption. . 
[0015] Fig. 1 represents the functional elements of the 

40 above-described system, wherein a processor forms a 
central station 2 that sends out a request for data on line 
3 to an intrinsic health monitor system 4 and receives 
failure probability alarms on line 4*. That is. the central 
station 2 is electronically linked to the intrinsic health 

45 monitor system 4 and receives failure probability status 
reports on demand which are estimates of the so-called 
health of the equipment being monitored. The central 
station 2 can be in the form of a PC-based computing 
system that includes a diagnostic testing library 6 so 

so that the processor 2 directs the diagnostic testing to be 
performed by a request on line 5 and the test results are 
fed back on line 6' to the central station 2 to identify the 
failure. The processor 2 can then interrogate a refer- 
ence data bank 8 that includes a library of possible 

55 maintenance actions for the appropriate, work order for 
the impending identified failure by sending a request for 
a comparative match on line 7 and then receiving the 
pattern match characteristics on line 8'. The processor 2 
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then directs the maintenance action 10 that can include 
printing a work order for the maintenance personnel or 
purchasing department and identifying the replacement 
parts and supplies for the equipment being monitored 
by sending commands line 9. 

[0016] A more detailed view of the overall system is 
shown in Fig. 2, wherein one unit of a typical operating 
equipment or machine 12-1, which may be a turbine 
engine, a pump, motor, fan, compressor or any other 
machine-driven apparatus, is shown. Nevertheless, the 
present invention is intended to function within an over- 
all operating system including a number of units of oper- 
ating equipment, so that a number of different machines 
can be maintained following the present invention. For 
example, another unit of operating equipment 12-2 may 
be monitored as well, and the number of machines is 
generally unlimited up to 12-n. The operating parame- 
ters of the operating equipment 12-1 are sensed by a 
number of sensors 14-1 which may include a vibration 
sensor, a sensor for determining the machine load, a 
sensor for sensing the machine temperature, a sensor 
for sensing the electrical current being drawn by the 
operating equipment, and a sensor for sensing the 
ambient temperature. All of the various signals detected 
by these several sensors 14-1 are fed through a sensor 
interface unit 16-1 to a monitoring unit 18-1. which is 
shown in more detail hereinbelow. As will be described 
below, monitoring unit 18-1 provides local information in 
the form of a status display 20-1 to the operator on site 
at one location of the operating equipment 12-1 . In addi- 
tion, the monitoring unit is connected to a central work 
station 22 via lines 24 that may comprise conventional 
telephone lines. Located in the same location as the 
■- central work station 22 is a remote status display 26 for 
displaying the status of the operating unit 12-1 , 12-2, ... 
12-n. 

[001 7] Only one central work station 22 is required to 
support a number of monitoring units 18-1, 18-2. ... 18- 
n, which are the primary data gathering elements in this 
embodiment. Each monitoring unit 18-1. 18-2, ... 18-n 
gathers data by its separate sensor array 14-1 , 14-2, ... 
14-n and transmits the data over the telephone lines 24 
in this embodiment to the central work station 22. The 
data is gathered periodically, typically in ten to fifteen 
minute intervals, and then the monitoring unit 18-1 proc- 
sses the periodic data readings and provides an 
appraisal of the equipment status, that is, normal or not 
normal. This equipment status is provided on the local 
display 20-1 and is also transmitted on demand to the 
central work station 22. The hourly condition evalua- 
tions are formed locally in the respective monitoring unit 

■ 18-1 . 18-2. ... 18-n that include local processors, as will 
'be shown in Fig. 3 and. thus, require no further process- 
ing at the central work station 22. The central work sta- 

' tion 22 maintains two separate data bases, one for 
equipment status displays based on the hourly condi- 
tion evaluation data and the other for periodic data read- 
ings for subsequent analysis using predictive 



maintenance techniques. 

[0018] The details of the monitoring unit 18-1 of Rg. 2 
are shown in Fig. 3, in which an operating equipment 
unit or machine 12-1 is connected to be monitored by a 

5 primary variable sensor 40. such as a vibration sensor 
that provides primary intrinsic machine data, an electric 
current sensor 42 that provides independent Intrinsic 
data, and a load sensor 44 that provides information 
concerning the load of the machine 12-1 and may com- 

1 o prise a speed , strain, or pressure sensor thereby provid- 
ing dependent machine intrinsic data. An environment 
sensor 46 senses the temperature of the machine 12-1 
and, thus, provides additional dependent intrinsic data. 
In addition, the ambient temperature is detected by an 

15 ambient temperature detector 48. 

[0019] The data signals provided by the sensors 40, 
42, 44, 46, and 48 are processed by two different signal 
interface modules comprising the sensor interface 16-1 
of Fig. 2. A signal interface conditioner module 50 pro- 

20 vides an interface for vibration and current intrinsic data, 
as provided by the primary variable sensor 40. which 
detects vibration, and the independent variable sensor 
42, which detects machine current. The signal interface 
conditioner module 50 is provided with a reference volt- 

25 age from a reference voltage power supply 52 that pro- 
vides an accurate reference voltage for use in 
determining the data from the vibration arid current sen- 
sors 40. 42. A more general interface module 54 pro- 
vides an interface for the ambient temperature variable 

30 sensor 48 and the machine temperature or environment 
variable sensor 46, as well as for the load variable sen- 
sor 44. The general interface module 54 does not 
require a separate reference power supply to produce 
the appropriate reference voltages arid signal process- 

35 ing. General interface module 54 includes an internal 
analog-to-digital conversion circuit, su thafttieVeceived 
analog data is supplied to a control processor unit 56 via 
a digital data/control bus 58. 

[0020] Sensor data processed by the signal interface 

40 condition module 50 is transmitted to a programmable 
gain amplifier/attenuator unit 60 that adjusts the array 
data signal for amplitude and desired resolution. The 
analog vibration data is supplied on line 62 to a pro- 
grammable bandpass filter 64. The analog electrical 

45 current data is transmitted on line 66 to an analog-to- 
digital convertor 68. which also receives the ''analog out- 
put signal on line 70 from the bandpass filter module 64. 
Alternatively, both the current data and the vibration 
data can be bandpass filtered before being converted to 

so digital data. The programmable bandpass filter module 
64 provides high and low passband separation of the 
received data, thereby providing an anti-aliasing func- 
tion so that the energy content within each band rhay be 
processed and stored separately. It is contemplated in 

55 this embodiment that five separate bands are provided 
for separate processing over a continuous total band- 
width of 5 through 5,000 Hz. These five bands might be 
from 5-50 Hz, 50-200 Hz. 200-500 Hz, 500-2,000 Hz. 
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and 2,000-5,000 H2, respectively. Not only are these 
bands presented as typical values but. in fact, the bands 
need not be continuous over the data spectrum. 
[0021 ] In any event, once the data has been converted 
to digital data in the analog-to-digital converter 68. it is 
placed on the system bus 58 and supplied to the control 
processor 56 that controls the data acquisition process 
using a method described hereinafter. 
[0022] The control processor 56 has a dedicated res- 
ident memory 70 that stores all data fed thereto and 
also stores the operating program for the processor 56. 
In addition, the monitoring unit 18-1 can also include a 
so-called portable memory 72 that can be a floppy disc 
memory or the like, as well as a network module 74 and 
a communication module 76. which function to permit 
communication with the central work station 22 over the 
telephone lines 24, as shown in Fig. 2. 
[0023] As will be described hereinafter, the control 
processor 56 performs a learning process as repre- 
sented by one or more neural processors 78. The con- 
trol processor 56 also performs data analysis in the 
monitoring mode. 

[0024] The intrinsic health monitoring system operat- 
ing process is shown in Fig. 4 and consists of an execu- 
tive process and a number of special purpose routines, 
which perform sensor data acquisition, hourly average 
calculations, learn mode status calculations, and moni- 
tor mode status calculations. 

[0025] The executive process performs an initializa- 
tion routine at 80 upon start-up and also services 
input/output requests whenever operating time is availa- 
ble. The intrinsic health monitor system executive proc- 
ess of Fig. 4 is a "Ring-Commutator" program; it is not 
interrupt driven. The executive process sequences from 
task to task performing each routine in a fixed order 
based on the system master clock and controls when a 
raw data read operation 82. which corresponds to sen- 
sor data acquisition, will occur as represented at deci- 
sion step 84. The system master clock also forces the 
executive process to perform hourly average calcula- 
tions. The clock is set to initiate an executive commuta- 
tion sequence every 7.5 minutes and when a sequence 
is initiated the executive process performs the raw data 
read (RDR) routine 82 and after completing the RDR 
the executive process checks in step 86 to see if the 
hour counter is equal to eight, which is the indication to 
perform the hourly average calculation (HCR) routine 
88. It will be. noted that 7.5 x 8 - 60 minutes. If the hour 
counter decision 86 results in a total less than eight, the 
executive process returns to an "idle mode" as repre- 
sented by the path 90 and waits for either the next mas- 
ter clock prompt or it services input/output requests. If 
an hourly average calculation routine was performed at 
88. the executive process checks the system mode key 
in step 92 to see which of the applicable status routines 
should be performed, either the learn mode status cal- 
culations 94 or the monitor mode status calculations 96. 
both of which are described in detail hereinbelow. When 



either status routine 94 or 96 is completed the executive 
returns to idle mode 90. If any request for input/output 
services is received, as represented at step 98. during 
the data acquisition, status, or calculation routines, the 

5 request is not performed at operation 1 00 until the exec- 
utive process returns to the idle mode 90. Input/output 
requests are held in the communications module buffer 
until serviced during the next idle period. It has been 
found that the amount of time generally available for I/O 

70 service during a 7.5 minute time period is approximately 
4 minutes. 

[0026] The raw data read routine 82 of Fig. 4 (RDR) is 
shown in detail in Fig. 5 and operates to execute the 
acquisition and storage to memory of the sensory data 

/5 for the machines that are being monitored. The primary 
task of the raw data read routine 82 is to build a raw data 
(RD) table of sensor values for each parameter that is 
being monitored. The RDR routine 82 executes thirty 
readings from each non-vibration sensor and one hun- 

20 dred fifty (5 x 30) readings from each vibration sensor. 
In the initialize step 1 10, the number of machines (MN), 
the desired number of readings (RN), and the desired 
number of frequency bands (BN) are entered. Next, in 
step 112 the system configuration comprising the sen- 

25 sors and machine assignments are set and the amplifi- 
ers (not shown) are set as well. Also in step 112 the 
machine count, band count, and read count are all set to 
zero. In step 1 14 the amplitude for the pulse amplitude 
modulation and the band for the pulse frequency modu- 
lo lation are set and the analog-to<iigital converters are 
set for the system configuration. In step 116 the 
machine count and readings count are read and the 
vibration data for the particular band is read. After the 
vibration sensor data is read in step 11 6 it is checked in 

35 step 1 18 whether the band count is equal to the desired 
number of bands and, if not, the band count is incre- 
mented in step 120 and steps 114 and 116 are 
repeated. If all of the desired bands have been read, as 
determined in step 118. the sensors other than the 

40 vibration sensors are read in step 122. 

[0027] Next, in step 1 24 it is checked whether there 
are any OFF flags or stability flags set. and then a tem- 
perature table is formed from the data read in step 122. 
It is then checked in step 126 whether the readings 

45 count has reached the desired number and. if not. the 
readings count is incremented in step 128 and the pre- 
ceding steps are repeated. If the desired number of 
readings, as determined in step 126. has been reached 
a machine reading table is formed and stored in step 

so 130 and the machine count is incremented. Next, it is 
determined whether all the machines in the operating 
system have been read by checking in step 1 32 whether 
the machine count equals the number of machines 
being monitored and. if not. all of the preceding steps 

55 are repeated. If all of the machines to be monitored 
have been read out, the raw data read routine is exited. 
[0028] In checking for the off condition and for stability 
in step 124, if the current flow equals zero that means 
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that the machine is off, and the off flag is set. If the load 
readings vary beyond established load limits then the 
stability flag is set. The stability flag permits data to be 
tagged so as not to contaminate valid norms during 
hourly calculations. A maximum of three load conditions 
may be established. 

[0029] Thus, each value in the RD table is an average 
value, obtained with a statistical confidence equal to 
95%. The routine performs two separate checks on 
each reading obtained from the system sensors. It 
checks that the machine being monitored is running and 
that the machine loading is stable during the raw data 
read period. If either condition exits, that is, the machine 
is off or the load changes during the data acquisition, 
the data will be invalid. If the machine is in the off condi- 
tion during any reading, flag-A is set; if the load has 
changed more than ± 2%, flag-B is set. RD readings 
that have either flag set are not used in any further cal- 
culations, but they are stored in system memory. 
[0030] The hourly average calculation routine (HCR) 
shown in Fig. 6 is initiated after every eight operations of 
the raw data read routine. The task of the hourly aver- 
age calculation routine is to form hourly averages of the 
RDR data and to build the hourly reading (HR) table. 
The HR table data taken each hour is the data that is 
used for the learn mode and the monitor mode status 
determination. An additional task for the HCR of Fig. 6 
is to calculate certain parametric relationships such as 
the load to current ratio LC and the ambient to unit tem- 
perature difference ( T u - T A = TL ) . The HCR also clas- 
sifies RD data into load types, which allows all other 
data to be correlated to the load condition that was 
present during data acquisition. It will be appreciated 
that each load condition produces a different set of par- 
ametric norms. 

[0031 ] In the hourly average calculation routine shown 
in Fig. 6, the routine is initialized in step 140 in which the 
load count, the machine count, and the read count are 
set to zero. Next, in step 142 the raw data RD for the 
machine count and read count are fetched. There will 
be eight RD data for each machine. It is checked in step 
144 whether an OFF flag is present in the data and if it 
is set the data is invalid and will not be processed. In 
step 146 it is checked whether a stability flag has been 
set in the data and if it is set the data is not processed. 
If no such flags are found, the machine load at that data 
time is classified and limits ace calculated in step 148. 
The load count LC and machine read counts AMR, 
BMR, CMR are updated and the data is stored in the 
temperature table in step 148 as well. The machine 
read counts AMR, BMR. CMR represent the number of 
reads acquired for each of three load conditions. Next, it 
is determined in step 150 whether the read count RC 
has reached the desired number RN, here it is five, and, 
if not. the read count is incremented in step 152 and the 
raw data fetching step of 1 42 is repeated. If the desired 
number of readings as determined in step 150 has been 
reached then the machine count MC is incremented in 



step 154 and it is checked in step 156 whether all the 
machines have been read. If not, the raw data read step 
142 is repeated. If all of the machines have been read, 
then in step 158 the temperature table is examined, the 

5 hourly reading table (HR) is formed and stored, and the 
hourly calculate routine is exited. 
[0032] If a machine off flag was found in step 144 or 
an instability flag in step 146, it is checked in step 160 
whether the read count has reached the desired 

10 number and, if not, the read count is incremented in 
step 162 and then MRD is incremented in step 164. 
MRD represents the number of unusable readings. If 
the read count has not reached the desired number 
then the machine count is incremented in step 166 and 

is step 164 is next performed. Following step 164 it is 
checked in step 168 whether the machine count has 
reached the desired number and. if so, step 158 is per- 
formed and. if not, the preceding steps are repeated, 
starting with step 142. 

20 [0033] Thus, it is seen that a special feature of the 
HCR is that it will not form an hourly average unless at 
least five valid RD reads of the same machine load con- 
dition exist. The HCR will save load type data until addi- 
tional RD reads, acquired in subsequent hours, fills the 

25 requirement of five. In this manner the HCR builds the 
HR tables for each machine operational hour and stores 
them in system memory. If an hourly average is not cal- 
culated, the average from the previous hour is used. 
[0034] The learn mode status calculation routine 

30 (LSR). shown in detail in Fig. 7, performs an evaluation 
of the stored HR data tables obtained by the process 
explained in connection with Fig. 6. The LSR task is to 
determine if the acquired data constitutes a stable basis 
for normalcy. It is a premise of intrinsic health monitor- 

35 ing, as described herein, that normalcy consists of a set 
of stable relationships between the vibrational energy in 
all frequency bands emanating from an operating 
machine and machine load, electrical current, and unit 
temperature. The LSR calculates a twelve hour average 

40 and forms a 95% confidence interval, wherein the upper 
and lower limits of the confidence intervals are 
expressed as: 

X u = X + t^;X L = X-t^ (1) 

45 

where s equals the deviation of the average and t 
equals the percentage of the data distributions for 
eleven degrees of freedom. X is the value of the mean 
and n is the number of hourly averages used (12). 

so [0035] In the learn mode status calculation mode 
shown in Fig. 7. it is first checked in step 180 whether 
the hourly count is greater than eleven, that is, whether 
the hourly count has reached twelve. If so, then in step 
182 the twelve hour learn mode profile is calculated 

55 from the last twelve hours data. Then, using the equa- 
tions in (1) above the 95% confidence intervals for all 
machines and all loads for the first band are calculated 
in step 184. Next, it is determined in step 186 whether 
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the number of twelve-hour values in the 95% confidence 
interval equals the desired number of all machines and 
loads. Then, in step 188 the band stability counters are 
checked to determine whether the count therein repre- 
senting data lying between the calculated limits is less 
than or equal to four, in which case an equal weight, for 
example, five, is subtracted. On the other hand, if the 
count number in the band stability counter is greater 
than or equal to seven, then an equal weight, for exam- 
ple, five, is added to the band stability counters for all 
machines. Following this weighting operation, it is 
checked in step 190 whether any band stability count 
has reached one hundred and, if so, the appropriate 
machine/load monitor keys for the band under consider- 
ation are set in steps 192. If the one hundred count has 
not been reached, it is checked in step 194 whether the 
data in all five bands being considered have been 
learned and if not the band number is incremented in 
step 196 and steps 186 and 188 are repeated. If all five 
bands have been learned then it is checked in step 198 
whether the monitor keys have been set and. if so. in 
step 200 the mode control key is set to monitor and the 
confidence interval is stored. The learn mode status cal- 
culate routine is then exited. On the other hand, if the 
monitor keys are found in step 1 98 not to be set the rou- 
tine is exited, as well. 

[0036] Thus, it is seen that the LSR calculates the 
confidence interval for all bands of all load conditions for 
all of the machines being monitored. The LSR deter- 
mines the number of data points which are between X u 
and X L inclusively, for each confidence interval being 
calculated, in order to quantify the data distributional 
stability If seven or more data points are within the con- 
fidence interval, a mathematical weight is added to a 
stability counter, if less than five data points are within a 
confidence interval an equal weight is subtracted from 
the counter. For a count of five or six no value is added 
or subtracted. When the count of all the band stability 
counters (BSC) for any machine equals one hundred, 
then the machine is considered to be running normally 
and a basis for subsequently performing the monitor 
mode has been established. The last function of LSR is 
to change the system mode control key from the learn 
mode to the monitor mode if all the band stability 
counters for any machine are indicating sufficient stabil- 
ity When the mode key is set monitor for any machine, 
the last calculated confidence interval is saved in sys- 
tem memory for the monitor mode status calculations. 
[0037] The monitor mode status calculation routine 
(MSR). shown in detail in Fig. 8. performs an evaluation 
of each machine that is being monitored by comparing 
the present hourly average data in the HR table to the 
confidence intervals calculated in the learn mode. 
[0038] In the monitor mode calculate status process 
shown in Fig. 8. the first step 210 is to fetch present 
hourly data and the previous hourly data from the HR 
table. Then, in step 212 the machine count is gotten 
from the configuration table and the machine count. 



load count, and band count are set equal to one. Also in 
step 212 the confidence intervals CI for all machines, 
loads, and bands are fetched from memory and the 
moderate limits (L^) and extreme limits (L E ) are calcu- 

5 lated from the high and low confidence intervals. In step 
214 the specific L M and L E values for the machine 
count, load count, and band count are acquired and a 
failure probability counter FP (L j (B ) for this machine load 
and band is set to zero. Then, in step 216 the present 

io hourly data is checked against the moderate limit L M 
and if it exceeds the moderate limit the.neural processor 
subroutine is entered. More specifically, in the neural 
processor subroutine step 218 sets the failure probabil- 
ity counter FP (L ^ B) for the particular band of the 

is machine equal to four. In step 220 the previous hourly 
data is compared to check if it exceeds the moderate 
limit Lm and, if so. the failure probability counter FP (L)(B) 
is set to the previous value plus four. In step 222 the 
present hourly data is checked to see if it exceeds the 

20 extreme limit L E and. if so, the failure probability counter 
FP (D(B) ts se * t0 tne previous value plus eight. In step 
224 the previous hourly data is checked to see if it 
exceeds the extreme limit and, if so. the failure probabil- 
ity counter is set to its previous value plus four. Next, the 

2 5 band count is checked in step 226 and if it is equal to 
five, which is the total number of frequency bands being 
checked in this embodiment, then the load count is 
checked in step 228. If the load count reflects that all 
load conditions for this machine have been processed, 

30 which is three in this embodiment, then in step 230 the 
weighted probability for machine failure is calculated for 
each machine by summing all of the counts in the failure 
probability counters derived in the statistical -processing 
loop. If the machine count which is checked in step 232 

35 indicates that all the machines in the system have had 
their data processed then ail this calculated failure prob- 
ability data is submitted to a neural process represented 
at step 234 consisting of a fuzzy inference operation 
which combines the dependent and independent data 

40 into one conclusion for each machine. These conclu- 
sions are stored in system memory and the systems 
display, described in detail hereinbeiow, is updated in 
step 234. The value M X V L is the total vibration for a spe- 
cific machine and load. Following the display updating, 

45 the monitor mode calculate status routine is exited. If in 
step 226 it is determined that ail of . the bands had not 
yet been checked, the band count BC is incremented in 
step 236 and the preceding steps are repeated. Simi- 
larly, if in step 228 it is determined that all of the load 

so conditions for each machine had not been processed, 
the load count LC is incremented and the band count is 
reset in step 238 and the preceding steps repeated. 
Similarly, if in step 232 it is determined that all of the 
machines had not been processed then in step 240 the 

55 machine count is updated and the band and load counts 
are reset. If in step 216 the hourly data is found not to 
exceed the moderate limits then the failure probability 
for a single band value for a given load FP (L)(B) is set to 



7 



13 



EP0 908 805 A1 



14 



zero in step 240. 

[0039] Thus, in order to accomplish an evaluation, the 
MSR utilizes the HR table data for the present hour (N), 
the previous hour (N-1) and the individual confidence 
intervals for each vibration band, for a particular load 
condition. The MSR calculates a moderate limit (± LM) 
and an extreme limit (±LE) from the high (X u ) and low 
(X L ) confidence intervals and submits the H x , H^, L M 
and L E data to the statistical processor sub-routine that 
processes the data for each vibrational band of each 
machine load condition and calculates a weighted prob- 
ability for potential machine failure. If the failure proba- 
bility counter for any machine load condition is equal to 
one hundred, then the load condition failure probability 
is maximum. Each machine has its own counters for 
each of its load conditions which counters contain the 
sum of its individual vibration band counters. Each band 
counter can contribute a maximum of 20%, (five bands 
x 20 = 100) or any lesser amount to a lower limit of zero 
The final task of the MSR is to submit the load condition 
failure probability counts, the independently derived dif- 
ferential temperature T c and load to current ratio data 
(UC) to the neural processor and store the resultant 
hourly status, that is. the failure probability, in system 
memory and to update the video display of the failure 
potential of each machine. 

[0040] As shown in Fig. 9. the neural process applies 
antecedent block logic to each set of data produced by 
the statistical processor. In step 310 the consequent 
block logical product is the difference between the 
present hourly vibration data (MVL) and the learned 
norms (LVL) for the various load conditions of a 
machine. Similarly, in step 320 the consequent product 
is the load to current ratio difference between the 
present hour (MLC) and the learned norms (LLC) and in 
step 330 the product is the difference between the dif- 
ferential temperatures observed in the learn mode (LTL) 
and the present hour (MTL). In step 340 the logical sum 
is formed by dividing each product by three, which is the 
number of load conditions permitted in this embodi- 
ment, and forming a final failure probability sum. Note 
that any value above 100 is considered to be equal to 
100. which is the maximum failure probability value. 
[0041 ] Thus, it can be seen that a value of 1 00 can be 
achieved by the vibration data alone, with no contribu- 
tion from the load to current ratio or the differential tem- 
perature, or that small vibrational differences from 
normal can receive large value contributions from both 
the load ratio and the temperature differential in order to 
generate a maximum failure probability. 
[0042] As described above, a feature of the present 
invention is a display that may be both a remote display 
at the central status display unit, as well as a local dis- 
play for the operator of the particular operating equip- 
ment. A typical subsystem overview display is shown in 
Fig. 10. If it is assumed that the operating equipment is 
a fuel combustion boiler system, then it is seen in Fig. 
10 that the various functional elements of the operating 



systems, such as the fans, pumps, and the like are suit- 
ably labeled. Then an indication is provided that a prob- 
lem either is likely or is actually almost in a failure mode. 
For example, in regard to the forced draft fans, an illurrri- 
5 nated display 360 indicates that the failure probability is 
33%. whereas in regard to the feed water pumps, an 
illuminated display 362 indicates that the failure proba- 
bility is 90%. It is contemplated that all of the displays 
shown typically at 364 in which the failure probability is 
10 zero or less than 33% would be illuminated with green 
color, whereas in the case of the display 360, the display 
would be illuminated with a yellow color and in the case 
of the eminent failure shown in display 362, the display 
would be illuminated with a red color. 
is [0043] Fig. 11 shows a format of a status report as 
might be displayed on the central status display unit of 
Fig. 2 and in which each line 370 represents a condition 
presently being monitored. In each line 370 there is a 
normal range or band 372 that indicates the normal 
20 range of operation for that particular parameter. In addi- 
tion, the present status is represented by a bar 374 that 
may fall within the range of the normal status indicator 
372. Thus, this shows the current status as it relates to 
the overall range of normal operation. In addition, at 
25 location 376 a statement is provided concerning the 
system status and probability failure. In this example, 
because all of the present indicator bars 374 fall within 
their respective normal ranges 372. the probability of 
failure is zero percent. 
30 [0044] Another feature of the present invention is the 
ability to provide a display relating to a trend of operat- 
ing parameters, as well as a historical profile. Fig. 12 
represents typical displays that might be provided at the 
central status display unit in Fig. 2, in which parameter 
35 trends and a histogram are provided both for long term 
trends in an area 380 and short term trends in area 382. 
The parameters would, of course, be labelled so that 
the person monitoring the central display station would 
immediately know that, in fact, the vibration is increas- 
40 ing or the operating temperature is increasing or the 
like. 

[0045] Based upon the above description, it is seen 
that the present invention provides a pre-emptive main- 
tenance system that includes a central work station with 

45 a video display that is remote from a number of various 
industrial sites and which is connected to each site by 
telephone lines. The central work station may comprise 
a microcomputer and includes a keyboard, resident 
memory, a mouse, a modem, and the like. The video 

so display that provides various displays, such as shown in 
Figs. 10. 1 1 . and 12 may be a color monitor and is used 
to display the machinery status/sensor data that is 
acquired by the central work station over the telephone 
communication lines from the several distant transmit- 

55 ting stations located at the various industrial sites. The 
distant stations provide on-site monitoring of the indus- 
trial facilities, which typically consist of machinery that 
operates an industrial process. The system equipment 
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located at each cite includes the intrinsic health moni- 
toring unit, a local display unit, and a number of condi- 
tion sensors relative to each machine that is being 
monitored. If a number of monitoring units are located at 
a single industrial site, then one of the monitoring units 5 
will be a master unit and can be connected to each of 
the other monitoring units by a local area network. The 
master unit transmits all data to the local display as well 
as to the central work station. 

[0046] Each monitoring unit of the system according 10 
to the present invention is an identical hardware appara- 
tus that performs identical methods and that provides 
parameter monitoring of a number of machines for dis- 
play or transmission of display data to a central work 
station. Each monitoring unit includes a number of sen- is 
sor signal conditioner modules, as well as a reference 
voltage power supply, that receives the sensor inputs, a 
programmable amplifier module, a programmable band- 
pass filter, and a control processor module. In addition, 
in order to permanently retain the intrinsic parameter 20 
data each monitoring unit includes a resident memory 
which provides a permanent record of the data. 
[0047] Although the present invention has been 
described hereinabove with reference to the preferred 
embodiment, it is to be understood that the invention is 25 
not limited to such illustrative embodiment alone, and 
various modifications may be contrived without depart- 
ing from the spirit or essential characteristics thereof, 
which are to be determined solely from the appended 
claims. 30 

Claims 

1. A system for providing future failure probability 
information for operating equipment, comprising: 35 

a plurality of sensors arranged proximate the 
operating equipment and including a primary 
intrinsic parameter sensor, a dependent opera- 
tional load parameter sensor, a dependent 40 
operational parameter sensor, and an inde- 
pendent operational parameter sensor; 
monitoring means located at a same location 
as the operating equipment and connected to 
receive data signals produced by said plurality 45 
of sensors and including a control processor 
operating an artificial intelligence algorithm for 
processing said data signals initially according 
to a learning procedure and subsequently 
according to a monitoring procedure whereby so 
energy contents in discrete portions in the data 
signals produced by said plurality of sensors 
are evaluated and producing evaluation sig- 
nals; and 

work station means connected to receive said 55 
evaluation signals from said monitoring means 
and including a display for providing a visual 
display representing said evaluation signals. 



16 

2. A system as claimed in Claim 1 , wherein said mon- 
itoring means includes a second display located at 
the same location as the operating equipment for 
providing a visual display representing said evalua- 
tion signals. 

3. A system as claimed in Claim 1 or Claim 2, wherein 
said monitoring means includes: a reference power 
supply, and a signal interface unit connected to said 
primary intrinsic parameter sensor and to said ref- 
erence power supply for referencing a data signal 
from said primary intrinsic parameter sensor to a 
reference voltage from said power supply. 

4. A system as claimed in Claim 3. wherein said mon- 
itoring means further comprises a programmable 
attenuator/amplifier receiving an analog output sig- 
nal from said signal interface unit and a digital-to- 
analog converter for converting an output signal 
from said programmable attenuator/amplifier to a 
digital signal. 

5. A system as claimed in Claim 3 or Claim 4, wherein 
said signal interference means provides a digital 
signal based on the data signal from said primary 
intrinsic parameter sensor, wherein said monitoring 
means further comprises a digital system bus con- 
nected to said digital signal and wherein said con- 
trol processor is connected to said system bus for 
receiving said digital signal and performing a statis- 
tical analysis using said artificial intelligence algo- 
rithm to produce said evaluation signals. * 

6. A system as claimed in any of Claims 1 -5, wherein 
said monitoring means including a programmable 
bandpass filter receiving an analog data signal from 
said primary intrinsic parameter sensor for seg- 
menting the data signal into said discrete portions 
and producing respective outputs for said discrete 
portions, said outputs being used in said learning 
procedure for acquisition of a base energy distribu- 
tion and said outputs being used in said monitoring 
procedure for acquisition of a comparative energy 
distribution. 

7. A system as claimed in any of Claims 1 -6. wherein 
said monitoring means further comprises a signal 
interface unit having a digital-to-analog converter 
for converting at least a data signal from said pri- 
mary intrinsic parameter sensor to digital data for 
processing by said control processor. 

8. A system as claimed in Claim 7. wherein said mon- 
itoring means further comprises a neural processor 
connected to said control processor for performing 
said learning procedure using statistical analysis to 
establish a set of stable relationships between data 
from said primary intrinsic parameter sensor and 
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10. 



data from said dependent operational load sensor, 
said dependent operational parameter sensor, and 
said independent operational parameter sensor, 
respectively. 

A system as claimed in Claim 7 or Claim 8 wherein 
said control processor includes means for perform- 
ing a summary operating equipment failure proba- 
bility based on fuzzy logic. 

A system as claimed in Claim 9, wherein said con- 
trol processor operates according to an inference 
engine. 



11. A method of evaluating operating equipment to 
determine a maintenance requirement, comprising 
the steps of: 

sensing a plurality of operating parameters of 
said operating equipment, including sensing 
ambient temperature, equipment operating 
load, and vibration and producing respective 
data signals; 

performing a learning operation on the data 
signals using statistical processing; 
deriving a plurality of operating norms based 
on results of said statistical processing; 
utilizing statistical inference and fuzzy logic for 
analyzing the data signals based on the 
derived plurality of operating norms and pro- 
ducing a failure probability conclusion; 
transmitting the failure probability conclusion to 
a location remote from the operating equip- 
ment; and 

displaying the transmitted failure probability 
conclusion. 
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levels; 

and including the further steps of: 
determining data values within the upper and 
lower confidence levels evidencing correct sys- 
tem operation; 

performing monitoring mode calculations by 
comparing present raw data with the data val- 
ues evidencing correct system operation; and 
displaying to a user results of the step of com- 
paring. 

16. A method as claimed in any of Claims 11-15, 
therein the step of producing respective data sig- 
nals includes: 

forming sensed data into a raw data table; and 
repeating the steps of sensing and forming 
over a first set period of time to produce hourly 
data used in the step of performing a learning 
operation. 

17. A method as claimed in Claim 16 including the fur- 
ther step of checking the equipment operating load 
data signal for stability and setting a stability flag in 
the data if successive quantities of the equipment 
operating load data vary one from another more 
than a predetermined amount. 

18. A method as claimed in Claim 17 including the fur- 
ther step of checking data for the presence of the 
stability flag and, upon the presence of the stability 
flag, excluding the data having the stability flag from 
the produced hourly data. 



12. A method as claimed in Claim 11, wherein the step 
of transmitting includes transmitting the failure 
probability conclusion over one of commercial tele- 
phone lines, local area network, wide area network, 
cellular network, or satellite communications. 

13. A method as claimed in Claim 1 1 or Claim 12, com- 
prising the further step of performing anti-aliasing 
filtering on the data signals before the step of utiliz- 
ing statistical inference and fuzzy logic. 

14. A method as claimed in any of Claims 11-13, 
wherein the step of displaying includes determining 
within which one of a plurality of quantitative ranges 
the transmitted failure probability conclusion falls 
and displaying each quantitative range in a respec- 
tive different color. 

15. A method as claimed in any of Claims 11-14, 
wherein the step of performing a learning operation 
includes calculating upper and lower confidence 
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